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181 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 

NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-I . 2 
TGGGCTTCCCC 

241 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 



3 01 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGRMTYYCC 

Topo_I I_cleavage_site 
RNYNNCNNGYNGKTNYNY 

3 61 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 



FIG. 12 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

2 01 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 

2 51 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
3 51 CAACTGGACT AATGATC G AA C TTATTGAC A AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 AC TT AAGGG A AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTC TTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC AC TAAATATT TAATATTCCA 
601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
7 01 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 _ GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC T AC T ATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

13 01 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
1451 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1501 AAC C T ATT AC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1651 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
17 01 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA AC TATG ATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAAC TACT A AATTACTTTC TTCAGATTTC TGGATTATGA 
2 001 CTGCACAAAT T C TAAAG AG A AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
2201 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
2251 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2 301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC C TT AG AGATG AATCAATGAA 



FIG. 13 
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2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

2 551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2 601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

2 651 TCAATTGATA TGAAAACTCT TGC TTTAATG CCAAATATTA ACTTGAGAAT 

27 01 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 

2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAAC ATT AC C 

2801 CATTATTTTA G AAAG AC GAT TACAACCGAA GACTTTGCGA ATAAAACTCT 

2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2 901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 

2 951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251. TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 



FIG. 13 

(CONTINUED) 



1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

2 01 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKMNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

3 01 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
3 51 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRP IMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTL I VEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
8 01 INVSRENGFK FMMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAF FKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KYI FNRVCMI LKAKEAKLKS DQCQSLIQYD A 



FIG. 14 
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20734 

1 gcagcgctgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 
61 gcgcgctccc cgctgccgag ccgtgcgctc cctgctgcgc agccactacc gcgaggtgct 
121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 
181 ggacccggcg gctttccgcg cgctggtggc ccagtgcctg gtgtgcgtgc cctgggacgc 
241 acggccgccc cccgccgccc cctccttccg ccaggtgtcc tgcctgaagg agctggtggc 
301 ccgagtgctg cagaggctgt gcgagcgcgg cgcgaagaac gtgctggcct tcggcttcgc 
3 61 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 
421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 
481 ccgcgtgggc gacgacgtgc tggttcacct gctggcacgc tgcgcgctct ttgtgctggt 
541 ggctcccagc tgcgcctacc aggtgtgcgg gccgccgctg taccagctcg gcgctgccac 
601 tcaggcccgg cccccgccac acgctagtgg accccgaagg cgtctgggat gcgaacgggc 
661 ctggaaccat agcgtcaggg aggccggggt ccccctgggc ctgccagccc cgggtgcgag 
721 gaggcgcggg ggcagtgcca gccgaagtct gccgttgccc aagaggccca ggcgtggcgc 
7 81 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 
841 gcgtggaccg agtgaccgtg gtttctgtgt ggtgtcacct gccagacccg ccgaagaagc 
9 01 cacctctttg gagggtgcgc tctctggcac gcgccactcc cacccatccg tgggccgcca 
961— gcaccacgcg ggccccccat ccacatcgcg gccaccacgt ccctgggaca cgccttgtcc 
1021 cccggtgtac gccgagacca agcacttcct ctactcctca ggcgacaagg agcagctgcg 
10 81 gccctccttc ctactcagct ctctgaggcc cagcctgact ggcgctcgga ggctcgtgga 
1141 gaccatcttt ctgggttcca ggccctggat gccagggact ccccgcaggt tgccccgcct 
12 01 gccccagcgc tactggcaaa tgcggcccct gtttctggag ctgcttggga accacgcgca 
12 61 gtgcccctac ggggtgctcc tcaagacgca ctgcccgctg cgagctgcgg tcaccccagc 
1321 agccggtgtc tgtgcccggg agaagcccca gggctctgtg gcggcccccg aggaggagga 
1381 cacagacccc cgtcgcctgg tgcagctgct ccgccagcac agcagcccct ggcaggtgta 
1441 cggcttcgtg cgggcctgcc tgcgccggct ggtgccccca ggcctctggg gctccaggca 
15 01 caacgaacgc cgcttcctca ggaacaccaa gaagttcatc tccctgggga agcatgccaa 
15 61 gctctcgctg caggagctga cgtggaagat gagcgtgcgg gactgcgctt ggctgcgcag 
1621 gagcccaggg gttggctgtg ttccggccgc agagcaccgt ctgcgtgagg agatcctggc 
1681 caagttcctg cactggctga tgagtgtgta cgtcgtcgag ctgctcaggt ctttctttta 

17 41 tgtcacggag accacgtttc aaaagaacag gctctttttc taccggaaga gtgtctggag 
1801 caagttgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggagctgtc 

18 61 ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ctgctgacgt ccagactccg 

19 21 cttcatcccc aagcctgacg ggctgcggcc gattgtgaac atggactacg tcgtgggagc 
1981 cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggtga aggcactgtt 
2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc ctgggcgcct ctgtgctggg 
2101 cctggacgat atccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 
2161 gccgcctgag ctgtactttg tcaaggtgga tgtgacgggc gcgtacgaca ccatccccca 
2221 ggacaggctc acggaggtca tcgccagcat catcaaaccc cagaacacgt actgcgtgcg 
2281 tcggtatgcc gtggtccaga aggccgccca tgggcacgtc cgcaaggcct tcaagagcca 
2341 cgtctctacc ttgacagacc tccagccgta catgcgacag ttcgtggctc acctgcagga 
2 401 gaccagcccg ctgagggatg ccgtcgtcat cgagcagagc tcctccctga atgaggccag 
2461 cagtggcctc ttcgacgtct tcctacgctt catgtgccac cacgccgtgc gcatcagggg 
2 521 caagtcctac gtccagtgcc aggggatccc gcagggctcc atcctctcca cgctgctctg 
2581 cagcctgtgc tacggcgaca tggagaacaa gctgtttgcg gggattcggc gggacgggct 
2 641 gctcctgcgt ttggtggatg atttcttgtt ggtgacacct cacctcaccc acgcgaaaac 

2 701 cttcctcagg accctggtcc gaggtgtccc tgagtatggc tgcgtggtga acttgcggaa 
27 61 gacagtggtg aacttccctg tagaagacga ggccctgggt ggcacggctt ttgttcagat 
2821 gccggcccac ggcctattcc cctggtgcgg cctgctgctg gatacccgga ccctggaggt 
2881 gcagagcgac tactccagct atgcccggac ctccatcaga gccagtctca ccttcaaccg 
2941 cggcttcaag gctgggagga acatgcgtcg caaactcttt ggggtcttgc ggctgaagtg 

3 001 tcacagcctg tttctggatt tgcaggtgaa cagcctccag acggtgtgca ccaacatcta 
3 0 61 caagatcctc ctgctgcagg cgtacaggtt tcacgcatgt gtgctgcagc tcccatttca 
3121 tcagcaagtt tggaagaacc ccacattttt cctgcgcgtc atctctgaca cggcctccct 
3181 ctgctactcc atcctgaaag ccaagaacgc agggatgtcg ctgggggcca agggcgccgc 
3 241 cggccctctg ccctccgagg ccgtgcagtg gctgtgccac caagcattcc tgctcaagct 
3301 gactcgacac cgtgtcacct acgtgccact cctggggtca ctcaggacag cccagacgca 
3361 gctgagtcgg aagctcccgg ggacgacgct gactgccctg gaggccgcag ccaacccggc 
3 421 actgccctca gacttcaaga ccatcctgga ctgatggcca cccgcccaca gccaggccga 
3481 gagcagacac cagcagccct gtcacgccgg gctctacgtc ccagggaggg aggggcggcc 
3 541 cacacccagg cccgcaccgc tgggagtctg aggcctgagt gagtgtttgg ccgaggcctg 
3 601 catgtccggc tgaaggctga gtgtccggct gaggcctgag cgagtgtcca gccaagggct 
3661 gagtgtccag cacacctgcc gtcttcactt ccccacaggc tggcgctcgg ctccacccca 
3721 gggccagctt ttcctcacca ggagcccggc ttccactccc cacataggaa tagtccatcc 
3781 ccagattcgc cattgttcac ccctcgccct gccctccttt gccttccacc cccaccatcc 
3 841 aggtggagac cctgagaagg accctgggag ctctgggaat ttggagtgac caaaggtgtg 
3 901 ccctgtacac aggcgaggac cctgcacctg gatgggggtc cctgtgggtc aaattggggg 
3961 gaggtgctgt gggagtaaaa tactgaatat atgagttttt cagttttgaa aaaaa 
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MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 
AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRL 
CERGAKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALR 
GSGAWGLLLRRVGDDVLVHLLARCALFVLVAPSCAYQVCGPPLY 
QLGAATQARP P PHASGPRRRLGCERAWNHSVREAGVPLGLPAPG 
ARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRG 
PSDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRP 
SLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLEL 
LGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEE 
EDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNE 
RRF LRNTKKF I S LGKHAKL S LQ ELTWKMS VRDC AWLRRS PGVGC 
VPAAEHRLREEILAKFLHWLMSVYWELLRSFFYVTETTFQKNR 
LFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPAL 
LTSRLRF I PK PDGLRP I VNMD YWGARTFRREKRAERLTSRVKA 
LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPP 
E L YF VKVDVTG AYDT I PQDRLTEVI AS 1 1 KPQNT YCVRRYAWQ 
KAAHGHVRKAFKS HVS TLT DLQ P YMRQF VAHLQ ET S P LRDAWI 
EQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSI 
LSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHA 
KT F LRT L VRGVP E YGC VVNLRKTWNF PVEDEALGGT AFVQM PA 
HGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGR 
NMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRF 
HACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSL 
GAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQ 
TQLSRKLPGTTLTALEAAANPALPSDFKTILD 

FIG. 17 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 
CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 

TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 
CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 
AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 
CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 
AAAAAAAAAAAAAAAA 

FIG. 18 
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MetSerValTyrValValGluLeuLeuArgSerPhePhe 
TyrValThrGluThrThrPheGlnLysAsiiArgLeuPhe 
PheTyrArgLysSerValTrpSerLysLeuGlnSerlle 
GlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 
LeuSerGluAlaGluValArgGlnHisArgGluAlaArg 
Pr oAl aLeuLeuThr Ser Ar gLeuAr gPhe 1 1 ePr oLy s 
Pr o AspG lyL euAr gPr oil eVa 1 AsnMe t AspTy rVa 1 
ValGlyAlaArgThrPheArgArgGluLysArgAlaGlu 
ArgLeuThrSerArgValLysAlaLeuPheSerValLeu 
AsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAla 
S e rVa 1 L euG lyLeuAspAsp 1 1 eH i s Ar gAl aTrpAr g 
ThrPheValLeuArgValArgAlaGlnAspProProPro 
GluLeuTyrPheValLysValAspValThrGlyAlaTyr 
AspThr I lePr oGlnAspArgLeuThrGluVal I leAla 
Serllell eLy s Pr ©GlnAsnThr Tyr Cy sVal Ar gAr g 
Tyr Al aVa lValG InLy s Al aAl aHi sGlyHi s Va 1 Ar g 
LysAlaPheLysSerHisValLeuArgProValProGly 
AspProAlaGlyLeuHisProLeuHisAlaAlaLeuGln 
ProValLeuArgArgHisGlyGluGlnAlaValCysGly 
AspSerAlaGlyArgAlaAlaProAlaPheGlyGly 



FIG. 19 



i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 



gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 

GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 

TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 

GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 

leu lys glu leu val ala arg val leu gin arg leu cys glu arg 

CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 

GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 

GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



FIG. 20 
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130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 

TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 

trp gly leu leu leu arg arg val gly asp asp val leu val his 

TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 

CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 

ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 

GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 

ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 

leu gly cys glu arg ala trp asn his ser val arg glu ala gly 

CTG'GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 

GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 

ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 

AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 

ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 

GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 



280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 



FIG. 20 

(CONTINUED) 
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340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val_leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 
GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 

440 450 
ala pro glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 
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ala lys phe leu his trp leu 
GCC AAG TTC CTG CAC TGG CTG 

560 

leu arg ser phe phe tyr val 
CTC AGG TCT TTC TTT TAT GTC 



550 

met ser val tyr val val glu leu 
ATG AGT GTG TAC GTC GTC GAG CTG 

570 

thr glu thr thr phe gin lys asn 
ACG GAG ACC ACG TTT CAA AAG AAC 



arg leu phe phe tyr arg lys ser 
AGG CTC TTT TTC TAC CGG AAG AGT 
590 

ile gly ile arg gin his leu lys 
ATT GGA ATC AGA CAG CAC TTG AAG 



580 

val trp ser lys leu gin ser 
GTC TGG AGC AAG TTG CAA AGC 
600 

arg val gin leu arg glu leu 
AGG GTG CAG CTG CGG GAG CTG 



610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 



620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG _ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 



640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 



650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 



670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 



680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 



700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 



710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 



gin asp arg leu thr glu val 

CAG GAC AGG CTC ACG GAG GTC 

740 

asn thr tyr cys val arg arg 
AAC ACG TAC TGC GTG CGT CGG 



730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 

750 

tyr ala val val gin lys ala ala 
TAT GCC GTG GTC CAG AAG GCC GCC 
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760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 
GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 
ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGG AGTGAC C AAAGGTGTGC CCTGT AC AC AGGCGAGGACC C TGC AC CTGGAT 
GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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3 601 ATCGATTGGGCCCGAGATCTCGCGCGCGAGGCCTGCCATGGGACCCACTGCAGGGGCAGC 
TAGCTAACCCGGGCTCTAGAGCGCGCGCTCCGGACGGTACCCTGGGTGACGTCCCCGTCG 

3615 3636 
BGL2 NCOl 

3 661 TGGGANGCTGCAGGCTTCAGGTCCCAGTGGGGTTGCCATCTGCCAGTAGAAACCTGATGT 
ACCCTNCGACGTCCGAAGTCCAGGGTCACCCCAACGGTAGACGGTCATCTTTGGACTACA 

3721 AGAATCAGGGCGCGAGTGTGGACACTGTCCTGAATCTCAATGTCTCAGTGTGTGCTGAAA 
TCTTAGTCCCGCGCTCACACCTGTGACAGGACTTAGAGTTACAGAGTCACACACGACTTT 

37 81 CATGTAGAAATTAAAGTCCATCCCTCCTACTCTACTGGGATTGAGCCCCTTCCCTATCCC 
GTACATCTTTAATTTCAGGTAGGGAGGATGAGATGACCCTAACTCGGGGAAGGGATAGGG 

3 841 CCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTGGAGGAAGGAATGATACTTTGTTATT 
GGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACACCTCCTTCCTTACTATGAAACAATAA 



3 9 01 TTTCACTGCTGGTACTGAATCCACTGTTTCATTTGTTGGTTTGTTTGTTTTGTTTTGAGA 
AAAGTGACGACCATGACTTAGGTGACAAAGTAAACAACCAAACAAACAAAACAAAACTCT 



3 9 61 AGCGGTTTCACTCTTGTTGCTCAGGCTGGANGGAGTGCAATGGCGCGATCTTGGCTTACT 
TCGCCAAAGTGAGAACAACGAGTCCGACCTNCCTCACGTTACCGCGCTAGAACCGAATGA 



4 021 GCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCTGCTTCCGCCTCCCATTTGGCTGGGA 
CGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGACGAAGGCGGAGGGTAAACCGACCCT 



4 081 TTACAGGCACCCGCCACCATGCCCAGCTAATTTTTTGTATTTTTAGTANANACNGGGGTG 
AATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAAACATAAAAATCATNTNTGNCCCCAC 



4141 GGGGTGGGGTTCACATGTTGGCCAAGCTGGTCTCGAACTTCTGAACTCAGATGATCCANC 
CCCCACCCCAAGTGTACAACCGGTTCGACCAGAGCTTGAAGACTTGAGTCTACTAGGTNG 



42 01 TGCCTCTGCCTCCTAAAATTGCTGGGATTACAGGTGTNANCCACCATGCCCAACTCAAAA 
ACGGAGACGGAGGATTTTAACGACCCTAATGTCCACANTNGGTGGTACGGGTTGAGTTTT 

42 61 TTTACTCTGTTTANAAACATCTGGGTCTAAGGTAGGAANCTCACCCCACTCAATTTTTGT 
AAATGAGACAAATNTTTGTAGACCCAGATTCCATCCTTNGAGTGGGGTGAGTTAAAAACA 
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4 3 21 GGTGTTTTTAAGCCAATNANAAAATTTTTTNATGTTC^ 
CCACAAAAATTCGGTTANTNTTTTAAA 

4381 NNNNNNNNN^ 

4441 nishsinnmsi^^ 

4501 ISnSINNMQNDIISnSlNMJNN]^ 

4561 MNNNisnsnsn^^ 
4621 NisnsnsnsiNNNN^^ 
4681 NNNNisnsnsnsEK^^ 

4741 NNNISDSINNNNNNNNNM^ 

4801 NNisnsnsiOTNi^^ 

4861 NNNNNNMflN^^ 
4921 NNNNNN^^ 

4 981 NNNNNNNNNIS^^ C GGTGNNNGAGGG 

5041 NGCCANGRAGGGGGCCAGGTTCCAANTTCCCAACCKTTTTWGGARGGACNGCCCCCAGGG 
NCGGTNCYTCCCCCGGTCCAAGGTTNAAGGGTTGGMAAAAWCCTYCCTGNCGGGGGTCCC 

5101 GGGGATRAACAGA2STTNGGGGGKGGTWGGGTTNAKGGTGGGAACNCCTTNGCGCCTGGAG 
CCCCTAYTTGTCTNANCCCCCMCCAWCCCAANTMCCACCCTTGNGGAANCGSCGGACCTC 

5161 AACGTGCAAAGAGGAAATGAAGGGCCTGKGTCAAGGAGCCCAAGTNGGCGGGGRAGTTTG 
TTGCACGTTTCTCCTTTACTTCCCGGACMCAGTTCCTCGGGTTCANCCGCCCCYTCAAAC 

5221 C AGGGAGGC AC TC C GGGGAGGTCC SGCGTGCCCGTCCAAGGGAGCAATGCGTCC TTCGGG 
GTCCCTCCGTGAGGCCCCTCCAGGSCGCACGGGCAGGTTCCCTCGTTACGCAGGAAGCCC 

5281 TTCGTCCCCAWGCCGCGTCTACGCGCCTYCCGTCCTCCCCTTCACGTTCCGGCATTCGTG 
AAGCAGGGGTWCGGCGCAGATGCGCGGARGGCAGGAGGGGAAGTGCAAGGCCGTAAGCAC 

5341 GTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGATCAG 
CACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCTAGTC 

5401 GCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGCCCCT 
CGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCGGGGA 
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5461 CCCTCGGGTTACCCCACAGCCTAGGCCGGATTCGACCTCTCTCCGCTGGGGCCCTCGCCT 
GGGAGCCCAATGGGGTGTCGGATCCGGCCTAAGCTGGAGAGAGGCGACCCCGGGAGCGGA 



5521 GGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCATACCC 
CCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTATGGG 

5581 CCGGGTCCGCCCGGAAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCGC 
GGCCCAGGCGGGCCTTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGCG 

Topo_I I_c 1 eavage_s i t e 

5641 GGGCACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAAGGACTGGGGACCCGGGCACC 
CCCGTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTTCCTGACCCCTGGGCCCGTGG 



5701 

GCAGGACGGGGAAGTGGAAGGTCGAGGCGAAGAAGGCGCGCCTGGGCCGGGGCAGGGCTT 



57 61 CCCTTCCCAGGTCCCGGCCCAGCCCCTTCCGGGCCCTCCCAGCCCCTCCCCTTCCTTTTC 
GGGAAGGGTCCAGGGCCGGGTCGGGGAAGGCCCGGGAGGGTCGGGGAGGGGAAGGAAAAG 



5821 CGCGGCCCCGCCCTCTCCTTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGCGCA 
GCGCCGGGGCGGGAGAGGAAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACGCGT 

5860 5875 
EC047III FSP1 



5881 CGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAGCCG 
GCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTCGGC 

5941 TGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGCGGC 
ACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACGCCG 

5953 
FSP1 

6 001 GCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGC 
CGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGCGCG 

6061 TGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCCCCT 
ACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGGGGA 
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6121 CCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGG 
GGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGCCCC 

Topo_I I_c 1 e avage_s 

NFkB 

Intronl 

**********************************^^ + ^^^^^^ :fc ^ + 1(r + :yt + + + + +> 

6181 GGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTGTC 
CCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCACAG 

ite 



6241 CTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAA 
GACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTTCTT 

63 01 CGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTT 
GCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAA 

63 61 CACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGG 

GTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCC 

6372 
FSP1 

6421 GGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACG 
CCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCGTGC 

64 81 CTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCT 

GACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGGCGA 

6541 GTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAG 
CATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGCTTC 

6601 GCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGG 
CGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCC 

6661 CCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCC 
GGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGG 

6721 CAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTC 
GTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAG 

6781 CTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACC 
GACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAGTGG 

6841 TGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTC 
ACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAG 

6901 CCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACG 
GGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGC 
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6961 TCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTC 
AGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAG 

7 021 AGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGAC 
TCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGACTG 

7081 TGGCGCTCGGAGGCTCGTGGAGAC CATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGAC 
ACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCCCTG 

7141 TCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGA 
AGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGACCT 

7167 

EC047III 

7201 GCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCT 
CGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGGCGA 

7261 GCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGT 
CGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAGACA 

7321 GGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCA 
CCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGTCGT 

7 381 CAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCC 
GTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGGGGG 

7441 AGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCAT 
TCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTA 

7 501 CTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCG 
GAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCACGC 

7 561 GGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCCAGG 
CCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGGTCC 

7575 
FSP1 



7621 CCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCTCCT 
GGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGAGGA 



7681 GTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACGGTG 
CAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGCCAC 



7741 ATCGAGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCGTAATCATGGTCATA 
TAGCTCCAGCTGAGATCTCCTAGGGGCCCATGGCTCGAGCTTAAGCATTAGTACCAGTAT 

7747 
SAL1 
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33/3T 



gccaagttcctgcactggctgatgagtgtgtacgtcgtcgagctgctcaggtctttcttt 
tatgtcacggagaccacgtttcaaaagaacaggctctttttctaccggaagagtgtctgg 
agcaagttgcaaagcattggaatcagacagcacttgaagagggtgcagctgcgggacgtg 
tcggaagcagaggtcaggcagcatcgggaagccaggcccgccctgctgacgtccagactc 
cgcttcatccccaagcctgacgggctgcggccgattgtgaacatggactacgtcgtggga 
gccagaacgttccgcagagaaaagagggccgagcgtctcacctcgagggtgaaggcactg 
ttcagcgtgctcaactacgagcgggcgcg 



FIG. 23 



TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCC 
AGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 
AGGGGCAAGTC 
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